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Figure 1 

Different hint extension instructions. 



5 11 8 8 
M M M M ► 



Ld_bht_op 


bht_index 


br_mask 


bht_bits 



100 



-M- 



27 



NOP 



Unused 



109 



Keet2ofl3 Docket P09-96-134 by B.^Hroy 



tgore 2 

Overview of the Processor operation with Hint processor 



L2 Cache 
Directory 
211 



(DL2 address) 



(IL2 address) 




L2 Cache 
212 



IL1 cache miss 
Logic 



Hint Instruction Cache 
201 



(Program 
instructions) 



Instruction 
Decode 
Unit (IDU) 



(Fetch group) 



208 



(Dispatch group) 



(Hint 
instruction) 



-D-Dir — 












223 







DL1 cache miss 
Logic 



D-Cache 



221 



(Hint instruction) 



(Hint instruction) 



Branch detection, 
Branch prediction 
and fetch group 
formation logic 



BHT 
204 



207 



206 



Hint Processor 
(See Figure 2A) 



211 



Instruction Sequencing Unit 



209 



Global Completion Table (GCT) 



Valid 
bit 



Address of the 
dispatch group 



Dispatched group 
of instruction 





Issue 




Register 




Queue 




File 








(R0-R31) < 




216 







I 



(store data) 



(load data) 



Branch 


217A 


Fixed-point 


217B 


Execution 




Execution 




Unit 




Unit 





217C 



Floating-point 

Execution 

Unit 



"2T7D 



Instruction Execution Units (IEU) 



Load/Store 

Execution 

Unit 



(load/store address) 
217 



Sl^Bofl3 Docket P09-96-134 



by B.Sii 



glire 2A The Hint Processor 
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Overview of the Instruction Fetch Unit with Hint Processor 
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Figure 4 
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Figure 6 
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Figure 7 
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Figure 8 
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Start program execution 

| 804 
Set IFAR = first instruction address in the program 

806 h 

Read the IL1 directory entry indexed by IFAR bits 48:58 
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Goto Fig. 9, B 



813 

Does the IFAR bits 0:42 matches the "Address of the 
first instruction" field of the L2 directory entry? 

1 No (L2 miss) 
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If "type bit" field of the L2 directory 
entry is D castout the L2 entry to the 
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Assign "IL2_hint_wr_addr" as the L2 index equal to the concatenation 
of bits 43:47 of the "Address of the first instruction" field of the I-cache 
directory entry indexed by ILl_hint_wr_addr with IFAR bits 48:58. 

Access the L2 directory entry indexed by IL2_hint_wr__addr 

i 818 
Is the L2 entry valid, and does its "Address of the first instruction" field match bits 0:47 
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Load the IL1 entry with 8 program instructions from the "Eight Program Instructions" field and the hint instr 
of the entry in L2 cache indexed by IFAR bits 43:58 

Forward the hint instruction to the hint processor and execute them (see Figure 13) 
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Figure 9 



f^t\\ From Fi^nrp 8. 



901 

Fetch the IL1 cache line into "Eight Program Instructions" register, and the associated hint instructions 
in the Hint Instruction register. 

Access the BHT entry indexed by the IFAR bits 48:58, and fetch its BHT prediction bits into the 
"Eight Predictions" register. I 

I 903 
Use the IFAR bits 59:61 to locate "first instruction" in the "Eight Program Instructions" register. 
(Instructions before th~e"*^rst instruction", if any, will be ignored). - - - 

I 904 
Is there any branch instruction in the "Eight Program Instructions" register at or after the "first instruction"? 

Yes . I N ° 906 

Designate a "fetch group" as the instructions from the "first instruction" to the end of the register. 

Set-'Predicted_IFAR"-to the address of the next sequential instruction after the -^fetch group"^ 
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In the hint processor, fill the BHT Hint register with the following: bits 0:4 with "ld_bht_op", bits 5:15 with 
IFAR bits 48:58, bits 16:23 with a 8-bit "branch mask" field containing a 1 in the positions where there is a 
branch and 0 in other positions, bits 24:31 with the 8-bit BHT prediction. 

In the hint processor, store IFAR bits 48:58 in the BHT Hint Write Entry register 

Store the content of the BHT Hint register in the Hint Instruction feld in the IL1 Cache at the location 
in the BHT Hint Write Entry register 
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Is any branch indicated in the "Eight Predictions" register to be an unconditional branch or a conditional 
branch predicted taken? ^ 0 1 
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branch as the "last instruction". fetched. Designate the last instruction in the "Eight 
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921 

All instructions between the "first instruction" and the "last instruction" forms the "fetch group". 
For each branch in fetch group, obtain an invalid entry in the Branch Information Queue (BIQ), 
set its valid bit to 1 state and put: 

the address of the branch in the "Address of the branch" field, 

the branch target address in the "Predicted address" field if the branch is predicted taken or the 

sequential address in the "Predicted address" field if the branch is predicted not-taken, 

if the branch is at position "n", store the n-th bit in the "Eight Predictions" register in the 
"BHT bit" field. 

Place the branch in the branch issue queue for its subsequent execution. 

926 ^ 

Forward the fetch group to Instruction Decode Unit (IDU), see Figure 11. 

Goto Figure 10, C 
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Figure 10 

Next IFAR determination logic 



From Figure 9. 
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Set BHT_write_addr register to the 
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1021 I 
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Figure 11 

Instruction Decode and Dispatch 



( ENTER ) 

i 1101 
Is there a fetch group forwarded by IFU in this cycle? 

Yes N ° 



1102 

Form one or more dispatch groups from the fetch group received by the IDU from IFU following 
the rules of dispatch group formation (not more than five instructions per group, at most one 
branch in the dispatch group, fifth slot in the dispatch group is reserved for branch instructions 
only and if there is not enough instructions to fill all the slots in the dispatch group, insert NOPs). 

1103 | 

Obtain an invalid entry in the Global Completion Table (GCT) and fill its field with the information 
for the dispatch group and validate the entry 

Place each of the instructions in the dispatch group in the issue queue for subsequent 
execution (see figure 12) ^ 

( EXIT> 
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Figure 12 

instruction issue and instruction execution 

( ENTER) 



f 1201 
Is there any valid instruction in the issue queue for which all the operands are known? 

I I Yes 

1202 | N ° | 

Wait for one cycle. 1203 
I Forward the instruction to its proper execution unit (branch instruction 

1 to the branch execution unit, load/store instruction to the load/store 

execution unit, fixed-point instruction to the fixed-point execution unit, 
floating-point instruction to the floating-point execution unit). 
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Figure 13 

Execution in the Hint Processor for each IFAR cycle 
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"branch mask" field)) OR ("bhtjbits" field in 
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(All the logical operation are performed bit by bit) 

Store the 8-bit "new_bht" value in the BHT entry 1305 
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